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ABSTRACT 

In this paper we propose a parallel approach to quantifier 
elimination on real closed fields through a modification in 
the cylindrical algebraic decomposition (CAD) algorithm to 
accommodate parallelism. In this approach the speed-up 
due to parallelism obtained on an input formula is a prop- 
erty of that formula itself. Hence the time complexity of 
algorithm depends on not just the size of the input but on 
the input self. We classify prenex formulae depending on 
the amount of input-based parallelism that can be obtained 
from it based on which we obtain some insights into the 
complexity of the algorithm. We demonstrate performance 
of the proposed algorithm with experimental results. 

Keywords 
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1. INTRODUCTION 

The study of real algebraic geometry deals with the study 
of the real roots of an equation as, more often than not, 
the real roots are the most desired solutions to a system of 
equations. It is important to note that for some problems 
there are no counterparts in complex algebraic geometry. 
Three such examples are: (i) Hilbert's 16 th problem, (ii) 
Hilbert's 17 th problem, and (iii) Tarski-Seidenberg principle. 

Given a first-order formula with both quantified and quan- 
tifier - free variables, the process of finding an equivalent 
first-order formula in these quantifier-free variables is called 
quantifier elimination. When the first-order formula is a 
boolean combination of polynomial equations and inequali- 
ties, we can consider quantifier elimination as a problem in 
real algebraic geometry. 

The algorithm specified by Tarski [12] for quantifier elim- 
ination is highly resource intensive. So newer and more ef- 
ficient algorithms have come up and replaced it. The fact 
that a semialgebraic set can be divided into a finite number 
of semialgebraic sets (Tarski-Seidenberg principle) has led to 
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Cylindrical Algebraic Decomposition (CAD) which even- 
tually became the standard algorithm for quantifier elimi- 
nation over real closed fields. The time complexity of CAD 
algorithm is doubly exponential in the number of variables 
(both quantified and quantifier- free) . 

It is a recursive algorithm, which has a sequence of pro- 
jections, followed by a sequence of constructions, leading to 
the solution. CAD has undergone an extensive array of de- 
velopments, such as Hong's Partial CAD [5], Hong's projec- 
tion operator [9], Scott McCallum's projection operator [T], 
application of Grobner Bases to CAD [3| and a first step 
towards parallelism in CAD 
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This makes quantifier elimination through CAD impossi- 
ble for a wide range of real-world applications. The goal of 
the paper is to address this issue and expand the class of 
problems where CAD could be used to perform quantifier 
elimination using a reasonable amount of resources. 

The present work builds on 11 and extends the idea of 
parallelism to the entire algorithm. We show that paral- 
lelism can be introduced in various aspects of CAD. They 
are, (i) input to CAD can be made into independent units, 
(ii) projections can be made parallel, and (iii) constructions 
can be made parallel to the maximum possible extent. 

The input to CAD cannot always be split into independent 
units, because of the constraints imposed by first-order logic 
itself. This motivate us to introduce Skolemization to elim- 
inate the existential quantifiers. An ad-hoc improvement to 
these schemes is the introduction of chains of substitutions, 
which have the potential to reduce the number of variables 
in some polynomials, keeping the final truth value constant. 
These schemes benefit from the fact that speed-up happens 
at a doubly exponential rate. Reduction of just one variable 
reduces the time complexity greatly. These improvements 
are illustrated in the experimental results. 

The paper is organized as follows. The preliminary nota- 
tions and the original CAD algorithm are briefly introduced 
in Section 2. The idea of separability and the equi-satisfiable 
properties of substitutions are described in Section 3. The 
parallel CAD algorithm itself is described in Section 4. Em- 
pirical results for separability and substitutability are de- 
scribed in Section 5. A few directions sought for future re- 
search are mentioned in Section 6. 

2. PRELIMINARIES 
2.1 Basic Notation 

Let k[ ] denote the set of all polynomials in vari- 

ables, x\...x n and coefficients from the field k. The base 



field, k is assumed to be the set of real numbers, R through- 
out the paper. A field, k is called algebraically closed if every 
polynomial, p G k[xi,a;2, ...x n ] with coefficients from k has 
a root in k itself. The best known example is the set of 
complex numbers, C. 

Definition 2.1. A formula of the form 

(QiXi)...(Q n x n )[ip(y 1 ...y m , zi. ..£„)] 

is called a prenex formula, where Qi G {3, V} and tp is a 
quantifier-free formula in xi, ...x n , yi, ...y m - 

Definition 2.2. V C R is said to be a semialgebraic set 
if 3/i.../ s G R[a;i...a;„] and g\...gt G R[a;i...a; n ] such that 
V = {(ai...a n ) G R : /i(ai...a„) = 0, i = I...S, g ] {a 1 ...a n ) > 
0,i = 1...*}. 

A connected subset of R n is called a region in R" . Con- 
sider a region R and functions /i : R" — > R satisfying, 
/i < /2 < •■• < This ordering ensures that these func- 
tions do not intersect each other. A fi-section is the graph 
of the function ft. In other words, it's the set containing all 
points of the form (a, b), where a G R n_1 and 6 = /(«)■ An 
(f,fi+i)-sector is the set of all points (a,b), where a G R n_1 
and fi(a) < b < fi+i(a). A cylinder over a region R is the 
set of all points (a, b), where a G R and b G R. A stack over 
a region i? is the collection of sections and sectors that occur 
in the cylinder over R. A partition of R" into semialgebraic 
partitions is called an algebraic decomposition of R n . 

Definition 2.3. A partition o/R n satisfying the follow- 
ing two properties is called a cylindrical algebraic decompo- 
sition: 

1. If n = 1, then the CAD is a set of points and open 
intervals. 

2. If n > 1, every region of the CAD o/R™ _1 has a stack 
over it, which is a disjoint subset of some of the regions 
of the CAD of R n . 

A set of polynomials, F is said to be sign invariant on the 
region R iff no polynomial in F changes its sign anywhere 
on R. 

Definition 2.4. A CAD of R n in which each region is 
sign-invariant with respect to an input set of polynomials is 
called a sign-invariant CAD. 

The CAD algorithm has two phases in its description, 
namely projection and construction. Projection is specified 
by a projection operator. 

2.2 Cylindrical Algebraic Decomposition Al- 
gorithm 

This algorithm takes as input, a set of polynomials in n 
variables and generates a cylindrical algebraic decomposi- 
tion of the n-dimensional real space. This algorithm works 
recursively to produce a series of projections on lesser dimen- 
sions and builds the CAD of each dimension while returning 
the recurring functions. An outline of CAD algorithm 17] is 
presented below. 



polynomials in n — 1 variables is obtained as output, 
upon applying a projection operator. This process is 
continued till a set of univariate polynomials is ob- 
tained. 

2. These univariate polynomials can easily be solved (us- 
ing Sturm's theorem iteratively or otherwise). The 
roots of these univariate polynomials and the intervals 
between them are taken as the regions in the CAD 
of one dimensional real space. Designate a randomly 
selected point in each region as a sample point. 

3. For the two dimensional CAD, substitute the sample 
point found in each one-dimensional region, R in each 
of the polynomials in the 2-dimensional projection to 
again get univariate polynomials. Find the roots of 
these polynomials and create two dimensional regions 
that form the stack over the region R. Designate sam- 
ple points for these regions as well. Suppose we are 
given a CAD for fc-dimensional real space, we use the 
projected set for k + 1-dimensions and substitute the 
sample points of the fc-dimensional CAD in them to 
get univariate polynomials. These can now be solved 
to get k + 1-dimensional CAD. 

Now, this information can be used by another recursive al- 
gorithm to perform quantifier elimination [6]. Assume that 
the variables Xi,...,Xk are quantifier-free and x k+l ,...,x n 
have quantifiers, and let Qi be the corresponding quantifier 
for i 6 {k + 1, n}. Now we list the QE algorithm. 

QE(( J D 1 ,...,D„),(F 1 ,...,F ; )) 

Input: The CADs o/R 1 to R" (Di,...,D n ) and the formulae 
describing the regions of the CAD o/R fc Fi). 
Output: The quantifier-free formula equivalent to the input 
prenex formula. 

1. For k < i < n, we have 

(a) If Qi+i is 3, then C £ ft is valid if at least one 
region in the stack over C is valid. 

(b) If Qi+i is V, then C G Di is valid if all regions in 
the stack over C are valid. 

2. A region, C in D n is valid if 4>(tc) is TRUE, where tc 
is the sample point of that region. 

3. Obtain the cells of D k which are TRUE. The dis- 
junction of the formulae of these cells is the required 
quantifier-free formula and is returned. 

The projection operator takes as input a set of polyno- 
mials in n variables and returns another set of polynomials 
in n — 1 variables. Assume PSRCj(P,Q) denotes the 
principal subresultant coefficient of P and Q. 

Collins' projection operator takes as input the polynomi- 
als A = {Ai, A m }. Then it returns (i) The coefficient of 
each and every term of each Ai, (ii) PSRd(Uj , U'j) for ev- 
ery i and j and, (iii) PSRd(Uj, [/*,) for every i and j < k. 
Ui represents every element in the reducta set of Ai. 



1. Project each polynomial onto lesser dimensions. Poly- 
nomials in n variables are taken as input and a set of 



3. SOME OBSERVATIONS THAT LEAD UP 
TO PARALLELISM 

3.1 Separability 

The concept of separability is used when we need to con- 
vert one instance of quantifier elimination into multiple in- 
stances of quantifier elimination. 

Definition 3.1. For prenex formulae, f and g , if we have, 

d(xi,...X n ) = (Q r X r )...(QnXn) f (x fl , . . . , X f k ) C g (x gi , X gi ), 

where Qi G {3,V},Vi G {r, ...,n} and C G {V,A}, we say 
that g is separable from d if , supposing F — {xf 1 , Xf 2 , x/ k } 
and G = {x gi ,x g2 , . . . , x 9l } and H = F n G and 

Fl = {^/ll , Xh 2 3 •■•5 Xhj } ; 

1. C is V, t/ien every x £ H should not be the universal 
quantifier, V. 

2. C is A, then every x G H should not be the existential 
quantifier, 3. 

3. g does not evaluate to TRUE or FALSE upon quanti- 
fier elimination, and g has quantifier-free variables. 

Essentially, we would be converting 

(Qk+ix k+ i)...(Q n x„)[tp(xi...x„)], Qi G {3,V} 

into two (or more, if recursively applied) quantifier elimina- 
tion problems, 

X CY, 

X = {Qk+iXk+i)---(QnX n )[tp'(x 1 ...x„)] and 
Y = (Q k+1 x k+1 )...(Q n x n )[ip"(x 1 ...x n )] 

Q l G {3,V},CG {A,V} and 
^'(x 1 ...x n )C^"(x 1 ...x n ) = ip(x 1 ...x n ). 

The theory of separability seeks to address the conditions 
on ip' and tp", to allow such a splitting. 

The projection and construction phases of CAD algorithm 
are inherently sequential. A projection (construction) of 
a set of polynomials in k variables happens only after the 
corresponding set of polynomials in k + 1 dimensions (k — 
1 dimensions for constructions) has been projected (con- 
structed). Hence, none of the major phases of the original 
CAD algorithm can be executed in parallel. We can sep- 
arate the input, leading to multiple instances of the CAD 
algorithm with lesser number of variables. 

Note that the concept of separability is derived from the 
following two well known properties of quantifiers: 

1. (Vx)[/(z) A g(x)] (VaO/(x) A (Vx)g(x). 

2. (3x)[f(x) V g(x)} o (3x)f(x) V (3x)g(x). 

In the above two cases, we effectively separated f(x) ( or 
g{x)) from the formula on the left side. We now need to 
generalize this notion of separation to an n variable case. 

Theorem 3.2. Assume that 

d{x 1 ...x n ) = (Q r x r )...(Q n x n )f(x fl ...x fk )Cg(x gi ...x gi ) 

where C G {V,A} and Qi G {V, 3}, is a prenex formula. If 
f is separable from g and 

X = (Q r X r )(Q r + 1 X r + 1 )...(Q n X n )[f{x fl ,X h ,...,X fk )] 



and 

Y = (Q r X r )(Qr+lXr + l)...(Q n Xn)[g(x gi ,X g2 , —,X gi )] 

then, 

d(x u ...x n ) = X CY. 

Proof. Suppose F — {xf 1 ,xf 2 , Xf k } and G = {x gi , x g2 , x gi } 
and H — F n G and H — {xh 1 , x^ 2 , xi lj }. Then we have 
two cases, based on C: 

1. C is V: The definition of separability gives us that 
g can be separated if every x G H does not have the 
universal quantifier. We know that d is satisfied if 
we choose values for G — {x gi , x g2 , x gi } in such a 
way that g is satisfied. Some variables, namely H = 
{Xhi }, are common between / and g, and 
do not have the universal quantifier. If we are able 
to choose values for the variables in G or F to satisfy 
d, we implicitly chose values for the variables in H to 
satisfy d. This choice is feasible due to the absence of 
universal quantifiers for the variables in H. 

2. C is A: This is the dual of the previous case. 

□ 

3.2 Parallelism in Projections and Construc- 
tions 

Projections and constructions form the recursive steps of 
the CAD algorithm. So, it's natural to ask whether these 
can be parallelized. A step in this direction has been taken 
by Saunders et al. [Tl]. This paper proposed a way to find 
parallelism in the construction phase. We borrow this idea, 
suitably modified, and call it 'Pipelined Constructions'. A 
recurring feature in most of the projection operators is the 
fact that they deal with polynomials pair-wise. This can be 
utilized to work in a parallel fashion. 

The projection operators utilize polynomials pair- wise to 
compute resultants and discriminants. This can be done in 
a parallel fashion, using the divide and conquer paradigm. 
Assuming there are n polynomials in r variables, we could 
assign ^ of them to one processor and ^ to another pro- 
cessor. This goes on recursively till we get each proces- 
sor working on just two polynomials. These now generate 
the discriminants and resultants simultaneously. Assuming 
we have found the projections of X = {oi,a2,--i a -} an d 
Y = {ai +ll ...,a„} separately, we now combine them by 
pairing up each polynomial from X with every polynomial 
from Y. This procedure ensures that projections are done at 
effectively at a logarithmic function of the time taken by the 
original projection operator. This way of projecting shall be 
referred to as Parallel Projections. 

Construction phase parallelism depends on many issues. 
Clustering prevents any parallelism from entering construc- 
tion phase as some, if not all the regions would be inter- 
dependent. Hence, we consider a case without clustering. 
This idea has been explored in [Tl] . Suppose the n-dimensional 
CAD has regions, R = {n, r%, r^}. In the construc- 
tion stage, the stack over each region is constructed iter- 
atively. Suppose the stack over the region n has regions, 
R' = {riuTia, ...ri,}. The construction of the stack over ri 
is followed by the construction of the stack over the region 
T2- The observation to be made is that the construction of 
the stack over ri is independent of the construction of stacks 



over the regions of R' . These can be done in parallel. This 
is analogous to the stages in a CPU pipeline. Hence, this 
work shall refer to this process as Pipelined Constructions. 

3.3 Substitutability 

Most real-world problems have more than one polynomial 
in their prenex formulae, connected by boolean connectives. 
The observation to be made here is that, in most cases, none 
of these polynomials have all the variables that are used in 
the whole formula. The time complexity of the CAD al- 
gorithm has an upper bound of (mn) k d k , where m is the 
number of polynomials occurring in the input formula, r is 
the total number of variables, n is the maximum degree of 
any polynomial in any variable, k is some constant and d 
is the length of any integer coefficient of any polynomial. 
We observe here, that in general, if we could decrease r by 
even one variable, but add more number of polynomials to 
m, we'd get a faster running algorithm. There are instances 
where the elimination of one variable makes a problem com- 
putable in reasonable amounts of space and in reasonable 
time. But, there's no closed form for the roots of quin- 
tic and higher degree equations, as proved by Niels Henrik 
Abel. Hence, we can use only the variables, which have a 
degree of at most 4 to perform substitutions. 

In our definition of substitutability, we classify polyno- 
mials into two classes, namely (i) substitutor and (ii) sub- 
stituend. 

Substitutors are further sub-divided into two types, namely 
(i) substitutor-type I and (ii) substitutor-type II. 

Definition 3.3. A substitutor-type I polynomial should 
form an equation and has to contain atleast one variable 
whose exponent is at-most 4- 

Definition 3.4. A substitutor-type II polynomial forms 
an equation and has a variable of exponent at-most 4 in one 
of its factors, and is not a substitutor-type I. 

Definition 3.5. A substituend polynomial does not come 
under the substitutor-types I and II. 

Definition 3.6. Assume that P is any polynomial and T 
is a substitutor-type I. We say that T is substitutable in P 
if, the process of replacing a variable in P by substituting it 
with other variables from T does not result in an increase in 
the number of variables in the polynomial P. 

Using the above defined terms and operation, we can now 
state the following lemma, which essentially states that sub- 
stitutability produces an equi-satisfiable formula for a given 
prenex formula. We denote equi-satisfiability by the symbol 
in the following lemma. 



Proof. For any given n-tuple, (xi, x n ), g(xi,...,x n ) 
is either true or false. If it were true, 

f(xi,...,X n ) f'(xi, ...,X r -l,X r+1 , ...,X n ) 

. If it were false, g(xi, ...,x„) A f'(xi, x r -i, av+i, —,x„) 
and h(xi,...x n ) would both be false. Hence, h(x\, ...x n ) 
and f'(xi, ...,Xr-i,x r +i, ...,x n )/\g(xi, ...,x n ) have the same 
truth table, and hence are equi-satisfiable. □ 

4. PARALLEL CYLINDRICAL ALGEBRAIC 
DECOMPOSITION 

In this section we present a parallel CAD algorithm that 
is motivated by the observation we made above. 

4.1 Preliminary Considerations 

Skolemization is a process discovered by Thoralf Skolem [8] , 
which replaces all existentially quantified variables by new 
functions, called Skolem functions, which depend on the 
presently quantified variables. This leaves only the univer- 
sal quantifiers intact. When Skolemization is performed on a 
CNF formula, each term of the CNF becomes separable from 
the rest of the formula. This allows each term of the result- 
ing CNF to be a separate parallel process. But, the newly 
introduced Skolem functions should also be accounted for, 
in the input. This may add to the number of variables, thus 
increasing the running time. 

If separability was done before substitutions, we would 
not be considering the terms that would've become separa- 
ble if some variables have been substituted for others. Thus, 
in the algorithms that follow, substitution is done prior to 
separations. This also derives from the fact that separabil- 
ity depends exclusively on the variables, and substitutability 
never adds new variables to an existing formula (it can only 
remove variables and replace them with expressions contain- 
ing other variables that occur in that formula). 

The proposed algorithms depend on the following defini- 
tions of Conjunctive and Disjunctive Normal Forms, 

Definition 4.1. For polynomial equations and inequali- 
ties fij for some i,j, a prenex formula of the type, 

[(/nA/ 12 A.../ lfel )V(/ 21 A/ 22 A.../ 2fc2 )V...V(/„iA/„ 2 A.../ nfc J] 

is called a disjunctive normal form, and each (fn A fa A 
■ ■■fiki), for some i is called a 'term'. 

Definition 4.2. For polynomial equations and inequali- 
ties fij for some a prenex formula of the type, 

[(/nV/ 12 V.../ lfel )A(/ 21 V/ 22 V.../ 2fc2 )A...A(/ nl V/„ 2 V.../ nfc J] 

is called a conjunctive normal form, and each (fn V fa V 
■■■fiki), f or some i is called a 'term'. 

4.2 Proposed Algorithms 

We now list the proposed algorithms. 



Lemma 3.7. Assume a prenex formula, 

h(xi, ...x„) = f(xi, ...,x r , ...,x n ) A g(xi, ...,x n ), 

where f and g are both prenex formulae. Then, 

h(xi,...x n ) — , X r - 1 ,X r +l, x n ) A g(xi , ... , x n ) , 

where the polynomial /' is obtained by substituting x r in f , 
using g. 



PARALLEL-CAD (A) 

Input: A set of polynomials, A 
Output: The CAD for R 1 to R n , with respect to A 

1. Perform parallel projections to A to obtain Ai in n — 1 
variables. 



2. Project the polynomials in the set At to get the set 
Ai+i . This continues till we obtain the set A n -\, which 
consists of univariate polynomials. 

3. Perform pipelined constructions on the set of projected 
polynomials, {Ai, A n -i}. We obtain the decom- 
positions, D\. ...,D n from these constructions, corre- 
sponding to the CAD for R 1 to R", with respect to 
A. 

4. Return Di, D n . 

This algorithm is called simultaneously for each set of 
polynomials, derived from separations, after performing sub- 
stitutions. These are described in the following algorithm. 

PREPROCESSOR(f) 

Input: A prenex formula containing a boolean combination 

of polynomial equations and inequalities, / 
Output: Sets of polynomials, Si, Sk, where each Si is a 
set of polynomials. 

1. Consider the prenex formula, /. Convert / to a CNF, 
c, or to a DNF, d, whichever has the most number of 
terms. If it was converted to a DNF, complement it 
twice to get a complement of a CNF. Denote this CNF 
by c. 

2. Identify the substitutor-type-I polynomials in c. Use 
them to substitute in all other polynomials and also 
among themselves. Retain the substitutor polynomi- 
als, which weren't substituted in, as part of the CNF. 

3. Consider all substitutor-type-I polynomials as substitutor- 
type-II polynomials. Factorize them and substitute 
among the factors, along with other substitutor-type- 

II polynomials. The resulting formula shall be denoted 
by r. 

4. Perform Skolemization on r to get r s . Perform sepa- 
rations on r to get the prenex formulae, n,...,ri. If 
every term of r s has at most as many variables as the 
number of variables in max(ri, r;), use r 3 for fur- 
ther computations, and denote it by r. Otherwise use 
r. 

5. Assume r gives rise to k prenex formulae when sepa- 
rated. Collect these polynomials in sets, Si,...,Sk- 

6. Return the sets Si, Sk- 

The algorithm, PARALLEL-CAD (A) is executed for each 
A G {Si, Sk}- These k instances of Parallel-CAD are run 
simultaneously. Quantifier elimination can be done by con- 
sidering the connectives and the equalities and inequalities 
specified among the member polynomials of each set, Si, 
i £ {1, ...,&}. The resulting k quantifier- free formulae are 
united using conjunctions. 

4.3 Analysis of the Algorithms 

In the following sections and subsections, 'structure' in a 
prenex formula refers to the dependence of the constituent 
polynomials on each other, resulting in conditions/constraints 
being imposed on the formula because of these polynomials. 

For the analysis of the proposed algorithms, we use the 
classical notion of parallel complexity [To] . The parallel com- 
plexity analysis of PARALLEL-CAD can be studied using 
three parameters, 



1. Number of Processors: The number of processors is 
assumed to be constant, denoted by an upper bound 
of K. This assumption is very rigid as it does not allow 
the increase in the number of processing cores. This 
way, we can study the effects of parallelism in a case 
where we obtain more number of partitions than the 
number of processors. This leads to an optimization 
problem. 

2. Parallel Time Complexity: Assuming that we have a 
collection of independent parallel processes, the time 
taken by the longest process is called the parallel time 
complexity of the collection. A formula is separated 
only when none of the resulting partitions have all 
the variables present in the original formula. This 
ensures that, when separability happens, the paral- 
lel time complexity is exponentially lower than the 
original CAD algorithm. This reduction in time com- 
plexity happens even though the total work done by 
PARALLEL-CAD is greater than the original algo- 
rithm. 

3. Total Work Done: Given a collection of parallel in- 
dependent processes, the amount of work done by all 
the independent processes combined is called the total 
work done by the collection. Equational constraints 
in a prenex formula eliminate the need for many con- 
structions, thus speeding up CAD. But, if these con- 
straints were to be separated, then the resulting prenex 
formula without the constraints would perform unnec- 
essary constructions, thus increasing the resource con- 
sumption. This would also occur with inequalities. 

Separability is a property of the prenex formula. Separa- 
bility might result in the loss of structure in a given prenex 
formula. By this we mean that the interdependencies among 
the formula's constituent polynomials are negated, result- 
ing in wasteful and redundant computations, which would 
not have been required if the formula was not separated. 
We need a formal, quantitative definition of 'structure' in a 
given formula to study it's effects on the running time of the 
proposed algorithm. 

Definition 4.3. The 'sharing factor' existing among two 
given prenex formulae, f and g, is the number of variables 
that are shared between f and g. It's denoted by Tf, g . 

The sharing factor essentially captures the distribution of 
variables among the constituent polynomials of a prenex for- 
mula. If formulae / and g have k variables in common, it 
implies that / imposes conditions on g in k dimensions. The 
sharing factor gives a measure of structure present in a for- 
mula, as the sharing of variables between polynomials causes 
interdependencies among them. Having defined the sharing 
factor, we now define the operation, separation, where the 
sharing of variables comes into consideration. 

Definition 4.4. Consider a prenex formula, f , which has 
n variables, out of which, there are k quantified variables 
and n — k quantifier- free variables. We say that the sepa- 
ration of f is the ordered pair (F, C), if we obtain a set of 
prenex formulae, F — {/i, .../;} and a set of boolean oper- 
ators, C — {ci, ...,c;_i}, such that the following conditions 
hold. 



1- f 



/lCl/ 2 ...Ci_l/i 



Assume that fj = qjfj,j G {1,...,/}, where qj is the 
string of quantifiers and quantified variables, Q\X\...Qj k i 
Qi G {3,V}, i G {1, k}, and fj is the formula with- 
out the string of quantifiers. The condition for (F, C) 
to be a separation of f is, 



liXi. 



}kXk[f[cxf' 2 Cl...Cl- 1 f' l \. 



Loss of structure in a formula leads to redundant compu- 
tations. 

Definition 4.5. Consider a prenex formula, f and it's 
separation, (F,C). While executing PARALLEL-CAD on 
the set F, a computation is said to be redundant if it need not 
be performed while executing the original CAD algorithm on 

dundant computation 
total computations 



f. The redundancy ratio is given by ™dundant computations 



The definition of separation defines separability as a prop- 
erty of the formula. In the remainder of this section, we as- 
sume that all prenex formulae are in variables, xi, ...x n . It 
is natural to classify prenex formulae into classes based on 
the number of partitions it can be separated into. Thus, we 
have the following definition. 

Definition 4.6. A prenex formula is defined to be k - 
separable if and only if it can be separated into k prenex 
formulae, which are connected by boolean connectives. The 
set of all k-separable formulae shall be denoted by Sk ■ 

According to this definition, we can observe that Sk C 
Sk-i- The set of 1-separable formulae is the set of all prenex 
formulae, which shall be denoted by P. We assume that we 
have only a constant number of processors, and we denote 
this by K. This leads to a definition of the class of separable 
problems. 

Definition 4.7. Consider a set S C P. S is called the 
'separable class ' if the following conditions hold. 

1. S C S 2 

2. If f is a prenex formula and {F,C) is it's separation, 
V/ G F, 3xi G {xi, ...,x n } such that f is independent 
ofx,. 

The speed-up obtained by a k-separable formula depends 
on the distribution of variables among the k separated for- 
mulae and their interdependencies. Thus, we need to define 
a subset of Sk , which has the most ideal distribution of vari- 
ables. 

Definition 4.8. The centre of Sk is the set of all prenex 
formulae in n variables, which can be separated into k for- 
mulae, fx, ...fk, each with ? variables and every variable is 
present in exactly one formula in fi,-.,fk- The centre of 
Sk is denoted by Ck- 

4.3. 1 PARALLEL-CAD and Separable Class 

The above defined terms help us in analysing the extent of 
improvement in running time and space a problem can pos- 
sibly achieve when PARALLEL-CAD is used. The notion 
of the centre of separable class indicates the areas where the 
maximum amount of speed-up is possible. We now define 
the conditions which determine whether a given problem is 
closer to the centre than another problem or not. Denote 
the number of variables in a polynomial, p, by 



Definition 4.9. Given two k-separable prenex formulae 
f and g and their separations, (F,Ci) and (G, C2), we say 
that f is closer to the centre than g ( denoted by f < g) if 

1. maxj < maxj /» G F and gj G G, 1 < 
i,j < k. 

2. //max, #(/;) = max, then^ T ftJj < £ m , ; T 3m , gi 
, fi, fj £ F and g m ,gi G G, 1 < i, j, m,l < k. 

In the above definition, g is said to be farther away from 
the centre, compared to /. 

The time complexity of CAD algorithm is bounded above 
by (mn) k d k , where m is the number of polynomials occur- 
ring in the input formula, r is the total number of variables, 
n is the maximum degree of any polynomial in any variable, 
k is some constant and d is the length of any integer coef- 
ficient of any polynomial. This motivates the definition of 
comparability. 

Definition 4.10. Consider two prenex formulae, f and 
g. The maximum degrees of any polynomial in any variable 
in f and g are denoted by n\ and n 2 ■ The number of vari- 
ables in f and g are denoted by r\ and r 2 . Without loss of 
generality, assume that r± > r 2 . Consider a function, a : 
N x N — > N, such that, a(n, r) = (mn)'' d k . The formulae, 
f and g are said to be comparable if a{n\,r\) > a(n 2 ,r 2 ). 

We now characterize the relationship between PARALLEL- 
CAD and the separable class. If the first condition in the 
definition 4.9 is satisfied, then the formula closer to the cen- 
tre has fewer variables than the farther one, which might it 
execute faster (if both the formulae are comparable). On 
the other hand, if the second condition is satisfied, then a 
lesser amount of structure is lost on separations, thus neces- 
sitating a lower number of redundant computations. This 
motivates the following observation. 

Observation 4.11. Given prenex formulae, f and g and 
their separations, (F,Ci) and (G,C 2 ), if f ^ g, then PAR- 
ALLEL - CAD takes at least as much time and has at least 
as much redundancy ratio to execute g as it would take to 
execute f, provided that f and g remain comparable. 

5. SIMULATION RESULTS 

The simulation results have been divided into two cate- 
gories which are mutually independent, namely (i) The cases 
where only separations are considered and (ii) The cases 
where only substitutions are considered. 

These experiments were run on an Intel i7 quad-core CPU 
using 4GB of DDRAM. QEPCAD Version B 1.65, 10 May 
2011 [2] is used to perform the computations. In the follow- 
ing plots, time is in milliseconds and space is in cells used 
by QEPCAD. 

5.1 Simulation Results for Separability 

The simulation has been performed for two cases, depend- 
ing on the distribution of variables among the constituent 
polynomials of the prenex formula. This is based on the 
fact that the running time and the space required for CAD 
depend almost entirely on the number of variables, the time 
being doubly exponential in the number of variables. 



Case Study 1 - Homogeneous Distribution of Variables 
This case study involves four prenex formulae, each with 
two polynomial equations and with varying number of quan- 
tifier - free variables being common between them. The 
space and time analyses of this situation yields quite in- 
teresting consequences, when separability is applied. The 
polynomials considered were: 



1. (3a;) [ax 2 + bx + c = V dx 2 + ex + f = 0] 



2. (3a;) [ax 2 +k + c = 0V dax 2 + ex + f = 0] 
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3. {3x)[ax 2 + bx + c = V dax 2 + ebx + f = 0] 



4. (3x) [ax 2 + bx + c = 0W dax 2 + ebx + fc = 0] 
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■ Space with separations 



The time complexity reduced nearly five-fold in all the 
four cases. The space complexity remained same across all 
the four cases if CAD was applied without separations, as 
there would still be the same number of projection opera- 
tions. But, when separations were applied, the space re- 
quired increased exponentially as the overlap in the variables 
of the two polynomials increased. This was expected, con- 
sidering the doubly exponential nature of CAD algorithm. 
Case Study 2 - Heterogeneous Distribution of Variables 
This case study considers a similar system as above, ex- 
cept for the fact that the variables are unevenly distributed 
among the two polynomials and that there is a disparity 
among the degrees of the two polynomials. The formulae 
considered are: 



1. (3a;) [da fx 5 + ebx 3 + ec = V fc + x = 0] 



2. (3x)[dafx 5 + ebx 3 + ec = 0V/ + 2; = 0] 
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Here, due to separations, the structure imposed by the 
second equation on the first one is lost, thus leading to pos- 
sibly more number of constructions. But, as the dependency 
reduces, the 'amount of structure' lost is also reduced, and 
hence we observe that the separated case starts to perform 
better. 

Case Study 3 - Validity of Observation 1^.11 Two case 
studies have been performed demonstrating the validity of 
observation 4.11. The inputs for the two case studies were 

1. • 3x[ax 5 + abx 3 + be = 0] V [dx + ef = 0] V [gfx 10 + 

hgx 6 + ix i + j = 0] 

• 3x[ax 5 + abx 3 + be = 0] V [dx + e = 0] V [gfx 6 + 
hix + j = 0] . 

2. • 3x[ax 3 + bx + c = 0] V [cdx 2 + de = 0] 

• 3x[ax 3 + bx + c = 0] V [dx 2 + de = 0]. 

The time taken for execution and the space required were: 







First Formula 


Second Formula 


Case 1 


Time 


1584ms 


2128ms 




Space 


15126564 cells 


23304694 cells 


Case 2 


Time 


40ms 


36ms 




Space 


73998 cells 


66287 cells 



3. (3a;) [da fx 5 + ebx 3 + ec = V x = 0] 



In the first case, the time taken by the first formula was 
considerably lesser, even though it had a larger exponent in 
it's constituent polynomials. This is because of a more even 
distribution of variables. According to observation 4.11, the 
first formula is closer to the centre of 5*3 than the second 
formula. 

In the second case, the second condition of closeness to 
the centre is considered. Thus, the time consumed by both 
the formulae are almost identical. Due to sharing of vari- 
ables in the first greater amount of wasteful total 
work has been done, than in the second case. According to 
observation 4.11, the second formula is closer to the centre 
of S2 than the first one. 



5.2 Simulation Results for Substitutability 

The experiments have been done for 4 test cases. They 
do not dynamically schedule new instances to different cores. 
The 4 cases are, 

1. (\/x)(\/y)[[b 2 x 2 + a 2 y 2 = a 2 b 2 } [x 2 + y 2 < 1]] 

2. {3y)(3z)[[x 3 y - 29x 2 = 0] A [x 4 z 2 - 101x 3 - 19x = 
0] A [x 2 + y 2 + z 2 > 0] A [a; + y > 0] A [y + z < 0]] 

3. (Vu)[[x 3 y - 29w 2 = 0] A [x 4 z 2 - lOlw 2 - 19v = 0] A 
[a; 2 + y 2 + z 2 > 0] A [x + y > 0] A [w + v + x + z > 0]] 

4. (3v)[[x 5 + 4y 2 - 3z = 0] A [a; 6 + 2w 2 - v = 0] A [x + y + 
w + z + v > 0]] 

This simulation is intended to demonstrate the extent 
till which substitutions can improve the time and space re- 
quired. 
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In the above plots, we observe that the space required for 
computations decreased to a greater extent, as compared to 
the time. This is because of the fact that substitutions pre- 
serve the inter-dependency among the constituent polyno- 
mials in a prenex formula, unlike separations. Substitutions 
only change the way these polynomials are represented. This 
explains the difference between the space and time plots of 
the separations case and the substitutions case. The plots 
indicate that the space consumed and the time elapsed are 
disproportionate, suggesting that a large amount of memory 
has been used on projecting and constructing operations, 
which required small amounts of processing time. 

6. CONCLUDING REMARKS 

In this paper we proposed a parallel Cylindrical Algebraic 
Decomposition algorithm. The ongoing work is to imple- 
ment this algorithm for execution on a GPU. Based on the 
concepts of separable class and it's centre, we propose to give 
the average case complexity analysis of PARALLEL-CAD. 
This algorithm still uses the projection operators defined for 
the sequential CAD algorithm. Our future work is to explore 



for, and define a new projection operator, which embodies 
the structure imposed by parallelism. This projection oper- 
ator, when found, may speed-up CAD further. 
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